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This invention relates to improved methods for sequencing and genotyping nucleic acid in a single molecule configuration. The 
method involves single molecule detection of fluorescent labeled PPi moieties released from NTPs as a polymerase extension product is 
created. 
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A SYSTEM AND METHODS FOR NUCLEIC ACID SEQUENCING OF SINGLE 
MOLECULES BY POLYMERASE SYNTHESIS 

CROSS-REFERENCE TO RELATED APPLICATIONS 
This application claims priority to U.S. Provisional Patent Application 
Serial Numbers 60/1 12,078, filed December 14, 1998, and 60/1 15,496, filed January 11, 
1 999, the disclosures of which are hereby incorporated by reference in their entirely for 
all purposes. 

STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER FEDERALLY 
SPONSORED RESEARCH AND DEVELOPMENT 
[ Not Applicable ] 

FIELD OF THE INVENTION 

This invention relates to improved methods for sequencing and genotyping 
nucleic acid in a single molecule configuration. The method involves single molecule 
detection of fluorescent labeled pyrophosphate [PPi] moieties released from nucleotide 
triphosphate [NTPs] as a polymerase extension product is created. 

BACKGROUND OF THE INVENTION 

Previous non-electrophoretic sequencing and genotyping methods have not 
taken advantage of the continuous detection of PPi release from incorporated nucleotides. 
In the prior art of PPi sequencing (www.pyrosequencing.com), each nucleotide 
Adenosine-5' phosphate "A", Cytidine 5'-phosphate "C", Guanosine 5'-phosphate, 
Uridine 5'-phosphate "U" and Thymidine 5'-phosphate "T", is added separately to a 
reaction mixture containing the nucleic acid target and a polymerase. The current 
nucleotide is removed before the next is added. Incorporation of a nucleotide is 
accompanied by release of PPi from the NTP, detected by a secondary assay for PPi. A 
secondary assay is required because the PPi moiety of the NTP is not labeled. Only one 
nucleotide can be tested per cycle; if all 4 NTPs were present simultaneously, the 
polymerization reaction would continue uncontrolled and no sequence information could 
be obtained. Read length is limited by loss of synchronization among the target nucleic 
acid molecules in the sample. 
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Other non-electrophoretic methods, such as the stepwise ligation and 
cleavage of probes on DNA fragments attached to microbeads, requires synchronization 
of the DNA molecules that inevitably decays with each cycle. 

The present method solves these problems and has advantages over other 
5 sequencing methods. Stepwise addition of nucleotides is unnecessary, as all four 

nucleotides are added simultaneously. Sequence information is produced continuously as 
polymerases continually incorporate all four nucleotides into growing nucleic acid [NA] 
chains. There is no loss of synchronization because single molecules are observed 
separately. Analysis of single molecules also allows for the use of NA fragments taken 
1 0 directly from organisms. With the present method, it is no longer necessary to prepare 
NA for sequencing using cloning or amplification processes, although NA so prepared 
can still be sequenced. In addition, there is the possibility of sequencing and genotyping 
many different nucleic acids on a single surface simultaneously. 

BRIEF DESCRIPTION OF THE FIGURES 

15 Figure 1 is the chemical structure of quenching moieties connected 

through a linker to uracil. 

Figure 2 describes a dye linked to the gamma phosphate of a nucleotide 
also bearing a quenching moiety. 

Figure 3 is an overview of a synthetic pathway for generating nucleotides 
20 with fluorophores on the gamma phosphate and the quenching moiety on the base. 

Figure 4 is an overview of the system for polymerase synthesis sequencing 
of target nucleic acid. 

Figure 5 is a schematic view of the method where a DNA polymerase is 
immobilized onto a solid support and the target nucleic acid is DNA. 
25 Figure 6 is a schematic view of the method where a RNA polymerase is 

immobilized onto a solid support and the target nucleic acid is double stranded DNA. 

SUMMARY OF THE INVENTION 
This invention provides for a method of genotyping or sequencing a target 
nucleic acid [NA] comprising the steps of: i. immobilizing a nucleic acid polymerase or 
30 the target nucleic acid onto a solid support in a single molecule configuration; ii. 

contacting the solid support with a solution containing: (a) a target nucleic acid where the 
polymerase is immobilized or a polymerase where the target nucleic acid is immobilized; 
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(b) a primer nucleic acid which complements a region of the target nucleic acid 
downstream of the region to be sequenced; (c) NTP where each type of base is 
differently labeled on the phosphate portion, where the labels provide a unique signal that 
is selectively detectable upon incorporation of NTP into the polymerase extension 
5 product; iii. permitting the polymerase to sequentially extend the primer incorporating the 
NTP creating a complement to the target nucleic acid; and, iv. determining extension of 
the primer by detecting the unique signal from the labeled NTP to genotype or to 
sequence the target nucleic acid. The method may use a solution which contains at least 
two different types of NTP. 

10 It is preferred that the NTPs are labeled on the gamma phosphate with a 

fluorescent label that differentially fluoresces when the gamma phosphate is cleaved 
away from the nucleoside. The immobilized moieties can further comprise an array of 
locations each bearing a single molecule of polymerase or target nucleic acid. The 
immobilized moiety can be a polymerase positioned as an array of individual, single 

15 molecules onto a solid support. For genotyping the target nucleic acid may be a single 
nucleotide polymorphism. In such circumstances one need only add a solution which has 
a single type of NTP. For sequencing the detection involves a sequential detection of 
different types of NTP to provide the sequence of the target nucleic acid. In a preferred 
embodiment, the dNTP is labeled with a fluorophore on the gamma phosphate and a 

20 quencher moiety. In another preferred embodiment the solution contacting the solid 
support flows past the immobilized polymerase or target NA. The polymerases can be 
DNA dependent or RNA dependent DNA polymerases or DNA dependent RNA 
polymerases. The NTPs can be ribonucleotide triphosphates [rNTPs] or deoxynucleotide 
triphosphates [dNTPs] depending on the target nucleic acid and the polymerase in use. 

25 This invention further includes a system for genotyping or sequencing a 

target NA comprising: i. a solid support having a surface bearing an immobilized nucleic 
acid polymerase or the target NA in a single molecule configuration; ii. a solution for 
contacting the surface containing: (a) a target NA where the polymerase is immobilized 
or a polymerase where the target NA is immobilized; (b) where the polymerase requires 

30 the use of a primer nucleic acid there is added a primer which complements a region of 
the target NA downstream of the region to be sequenced; (c) a molar excess of NTP, 
where each type of NTP base is differently labeled on the phosphate portion and where 
the labels provide a unique signal that is selectively detectable upon incorporation of the 
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NTP into the polymerase extension product. The system embraces the same 
embodiments identified above for the methods. Where the system involves flowing 
solutions, the force for the flow may be mechanically generated or electroosmotically 
generated using electrodes. 

5 This invention further provides for a solid support having a surface where 

the surface has a nucleic acid polymerase array attached to it wherein the members of the 
array consist of individual molecules of polymerase. The array members are optionally 
addressed so that the locations are defined and comparative information between sites can 
be generated and recorded by the optical reader. The solid support may have DNA 
1 0 dependent DNA polymerase or a DNA dependent RNA polymerase or a RNA dependent 
DNA polymerase (reverse transcriptase) attached to it. Preferably the immobilized 
moieties are in a single molecule configuration. 

DEFINITIONS 

"Addressable" in the context of an array refers to members of the array 
1 5 located in discrete and defined regions. 

"Array" refers to a solid support having more than one site or location 
having either a target nucleic acid or a nucleic acid polymerase bound thereto. 

"Complements a region of the target nucleic acid downstream of the 
region to be sequenced" in the context of sequencing or genotyping refers to the fact that 
20 the primers are extended in a 3 ' direction by a polymerase. Therefore the primer binds to 
a subsequence of the target 3' (downstream) to the target sequence that is to be 
determined as the 3 ' end of the primer is extended. 

"Genotyping" is a determination of allelic content of a target DNA without 
necessarily determining the sequence content of the entire DNA. It is a subset of 
25 sequencing. For example the identification of single nucleotide polymorphisms by 

determination of single base differences between two known forms of an allele is a form 
of sequencing that does not require all the target DNA to be sequenced. 

"Immobilizing" refers to the attachment of a target nucleic acid or 
polymerase to a solid support by a means that prevents its release in a reaction solution. 
30 The means can be covalent bonding or ionic bonding or hydrophobic bonding. 

"Optical reader" refers to a device that can detect and record light emitted 
from the labeled dNTP. 
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"Sequencing" refers to the determination of the order and position of bases 
in a Nucleic acid. 

"Single molecule configuration" refers to an array of molecules on a solid 
support where members of the array are present as an individual molecule located in a 
5 defined location. The members can be the same or different. 

"Type of base" in the context of nucleotide triphosphates [NTPs] refers to 
nucleotides able to be recognized as a substrate for the polymerase. Typical bases include 
adenine, cytosine, guanine, uracil, or thymine bases where the type refers to the 
subpopulation of nucleotides having that base within a population of NTPs bearing 
10 different bases. Other rarer bases or analogs can be substituted such as xanthine or 
hypoxanthine or methylated cytosine. 

DETAILED DESCRIPTION 

1. Introduction. 

This invention provides for a novel means to genotype and sequence a 
1 5 nucleic acid. The method described herein uses individual fluorogenic NTP molecules 
that are identified one at a time as a RNA or DNA dependent polymerase incorporates 
their respective NTP into the extension products. The NTPs carry two adducts: a 
fluorescent dye attached to the gamma phosphate and a fluorescence quencher attached to 
either the base or sugar or dye. When the quencher is attached to the base or sugar, the 
20 NTP is hydrolyzed upon incorporation into the extension product and the liberated 

pyrophosphate-dye moiety becomes fluorescent. The free dye now unquenched, becomes 
fluorescent and its appearance is imaged at video-rate under a microscope. 

A flowing stream sweeps the dye away from the parent molecule. All four 
NTPS are present simultaneously. As the polymerase continues to move along the target 
25 nucleic acid, the nucleotide sequence is read from the order of the released dyes. 

2. Sources of target nucleic acid. 

The target nucleic acid is not critical and can come from a variety of 
standard sources. It can be mRNA, ribosomal RNA, genomic DNA or cDNA. When the 
target is from a biological source, there are a variety of known procedures for extracting 
30 nucleic acid and optionally amplified to a concentration convenient for genotyping or 

sequence work. Nucleic acid can be obtained from any living cell of a person, animal or 
plant. Humans, pathogenic microbes and viruses are particularly interesting sources. 
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Nucleic acid amplification methods are also known. Preferably, the 
amplification is carried out by polymerase chain reaction ( PCR) (U.S. Pat. Nos. 
4,683,202. 4,683,195 and 4,889,818; Gyllenstein et al., 1988, Proc. Natl. Acad. Sci. USA 
85: 7652-7656; Ochman et al., 1988, Genetics 120: 621-623; Loh et al., 1989, Science 
5 243 : 2 1 7-220; Innis et al., 1 990, PCR 

Protocols, Academic Press, Inc., San Diego, Calif.). Other amplification 
methods known in the art can be used, including but not limited to ligase chain reaction 
(see EP 320,308) use of Q- beta replicase, or methods listed in Kricka et al., 1995, 
Molecular Probing, Blotting, and Sequencing, Chap. 1 and Table IX, Academic Press, 
10 New York. 

3. Immobilization 

The single molecule arrays of use in this invention involve a support, a 
bioreactive or bioadhesive layer and a bioresistant layer. The support can be glass, silica, 
plastic or any other conventionally non-reactive material that will not create significant 

15 noise or background for the fluorescent detection methods. The bioadhesive layer can be 
an ionic adsorbent material such as gold, nickel, or copper (Montemagno and Bachand 
(1999) Constructing nanomechanical devices powered by biomolecular motors. 
Nanotechnology 10: 225-231), protein-adsorbing plastics such as polystyrene (US Pat. 
No. 5,858,801), or a covalent reactant such as a thiol group. To create a patterned array 

20 of the bioadhesive layer, an electron-sensitive polymer such as polymethyl methacrylate 
(PMMA) coated onto the support can be etched in any desired pattern using an electron 
beam followed by development to remove the sensitized polymer. The holes in the 
polymer are then coated with a metal such as nickel and the polymer is removed with a 
solvent, leaving a pattern of metal posts on the substrate. This method of electron beam 

25 lithography provides the very high spatial resolution and small feature size required to 
immobilize just one molecule at each point in the patterned array. A second means for 
creating high-resolution patterned arrays is atomic force microscopy. A third means is X- 
ray lithography. 

The biologies can be attached to the bioadhesive pattern by providing a 
30 polyhistidine tag on the biologic that binds to metal bioadhesive patterns. Other 

conventional means for attachment employ homobifunctional and heterobifunctional 
crosslinking reagents. Homobifunctional reagents carry two identical functional groups, 
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whereas heterobifunctional reagents contain two dissimilar functional groups to link the 
biologies to the bioadhesive. A vast majority of the heterobifunctional cross-linking 
agents contain a primary amine-reactive group and a thiol-reactive group. Covalent 
crosslinking agents are selected from reagents capable of forming disulfide (S-S), glycol 
5 (-CH(OH)-CH(OH)-), azo (-N=N-), sulfone (-S(=0 2 -), ester (-C(=0)-0-), or amide (- 
C(=0)-N-) bridges. 

A bioresist layer may be placed or superimposed upon the bioadhesive 
layer either before or after attachment of the biologic to the bioadhesive layer. The 
bioresist layer is any material that does not bind the biologic. Examples include bovine 
10 serum albumin, gelatin, lysozyme, octoxynol, polysorbate 20 (polyethenesorbitan 

monolaurate) and polyethylene oxide containing block copolymers and surfactants (US 
Pat. No. 5,858,801). Deposition of the layers is done by conventional means, including 
spraying, immersion and evaporative deposition (metals). 

4. Labeling of NTPs 

15 A. Attachment of a y-Phosphate Fluorophore 

The methods of the present invention involve detecting and identifying 
individual fluorogenic dNTP molecules as a polymerase incorporates them into a single 
nucleic acid molecule. In certain aspects, a fluorescent dye is attached to the y-phosphate 
and a quencher is attached to the nucleobase. As such, the present invention provides a 

20 nucleotide triphosphate (NTP) probe, comprising: a NTP having a y-phosphate with a 
fluorophore moiety attached thereto; a quencher moiety sufficiently proximal to the 
fluorophore moiety to prevent fluorescence of the fluorophore moiety; wherein the 
fluorophore moiety exists quenched with at least about a 5 fold quenching efficiency, 
preferably, at least a 10 fold quenching efficiency, when the y-phosphate is attached to the 

25 NTP and unquenched when the y-phosphate is detached from the NTP. 

In preferred aspect, the NTP probe is a dNTP probe having a fluorescent 
dye attached to the y-phosphate moiety and a quencher attached to the nucleobase. 
Suitable nucleobases include, but are not limited to, adenine, guanine, cytosine, uracil, 
thymine, deazaadenine and deazaguanosine. The quenched dNTPs are non-fluorescent 

30 when the y-phosphate is attached to the NTP, and thereafter become fluorescent when the 
Y-phosphate is unattached to the NTP. 
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B. Fluorescence Quenching 

In single molecule detection, high quenching efficiency is advantageous as 
it reduce fluorescence background, thus permitting the use of higher nucleotide 
concentrations. Several quenching mechanisms exist (see, for example, G.G. Guilbault, 
5 Practical Fluorescence, Marcel Dekker, New York, 1973) . In certain instances, the 
quenching depends on spectral overlap between fluorophore and quencher, and it 
functions at long range (fluorescence resonance energy transfer, FRET). In other 
instances, the fluorophore and quencher interact between molecular orbitals and require 
contact between fluorophore and quencher e.g. electron transfer mechanisms. In still 

10 other instances, a ground-state complex quenching mechanism can occur. All such 
quenching mechanisms are within the scope of the present invention. 

In certain aspects, the fluorophore moiety are fluorescent organic dyes 
derivatized for attachment to y-phosphate directly or via a linker. Preferably, quencher 
moieties are also organic dyes, which may or may not be fluorescent, depending on the 

15 particular embodiment of the invention. For example, in one embodiment of the present 
invention, the fluorophore and the quencher are both fluorescent. In this embodiment, a 
fluorescent energy transfer mechanism can be used wherein the first fluorophore (e.g. 
fluorescein) is excited and emission is read from the second fluorophore (e.g. rhodamine). 
In these systems, dequenching is accomplished by hydrolysis of the fluorophore attached 

20 to the y-phosphate. 

In another embodiment, the fluorophore and quencher function by an 
electron transfer mechanism. In this aspect, a non- fluorescent quencher e.g. DABCYL or 
dinitrophenyl (see, Figure 1) absorbs energy from an excited fluorophore, but does not 
release the energy radiatively. These quenchers can be referred to as chromogenic 

25 molecules. 

There is a great deal of practical guidance available in the literature for 
providing an exhaustive list of fluorescent and chromogenic molecules and their relevant 
optical properties (see, for example, Berlman, Handbook of Fluorescence Spectra of 
Aromatic Molecules, 2nd Edition (Academic Press, New York, 1971); Griffiths, Colour 
30 and Constitution of Organic Molecules (Academic Press, New York, 1976); Bishop, Ed., 
Indicators (Pergamon Press, Oxford, 1972); Haugland, Handbook of Fluorescent Probes 
and Research Chemicals (Molecular Probes, Eugene, 1992) Pringshetm, Fluorescence 
and Phosphorescence (Interscience Publishers, New York, 1949); and the like. Further, 
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there is extensive guidance in the literature for derivatizing fiuorophore and quencher 
molecules for covalent attachment via common reactive groups that can be added to a 
nucleotide, as exemplified by the following references: Haugland (supra); Ullman et al., 
U.S. Pat. No. 3,996,345; Khanna et al, U.S. Pat. No. 4,351,760. 
5 Suitable donors and acceptors operating on the principle of fluorescence 

energy transfer (FET) include, but are not limited to, 4-acetamido-4'- 
isothiocyanatostilbene-2,2'disulfonic acid; acridine and derivatives: acridine, acridine 
isothiocyanate; 5-(2'-aminoethyl)aminonaphthalene-l -sulfonic acid (EDANS); 4-amino- 
N-[3-vinylsulfonyl)phenyl]naphthalimide-3,5 disulfonate; N-(4-anilino-l - 

10 naphthyl)maleimide; anthranilamide; BODIPY; Brilliant Yellow; coumarin and 

derivatives: coumarin, 7-amino-4-methylcoumarin (AMC, Coumarin 120),7-amino-4- 
trifluoromethylcouluarin (Coumaran 151); cyanine dyes; cyanosine; 4\6-diaminidino-2- 
phenylindole (DAPI); 5', 5"-dibromopyrogallol-sulfonaphthalein (Bromopyrogallol Red); 
7-diethylammo-3-(4'-isothiocyanatophenyl)-4-methylcoumarin; diethylenetriamine 

15 pentaacetate; 4,4'-diisothiocyanatodihydro-stilbene-2,2'-disulfonic acid; 4,4'- 

diisothiocyanatostilbene-2,2 ' -disulfonic acid; 5-[dimethy laminojnaphthalene- 1 -sulfonyl 
chloride (DNS, dansylchloride); 4-dimethylaminophenylazophenyl-4' -isothiocyanate 
(DABITC); eosin and derivatives: eosin, eosin isothiocyanate, erythrosin and derivatives: 
erythrosin B, erythrosin, isothiocyanate; ethidium: fluorescein and derivatives: 5- 

20 carboxyfluorescein (FAM),5-(4,6-dichlorotriazin-2-yl)aminofluorescein (DTAF), 2',7'- 
dimethoxy-4' 5 '-dichloro-6-carboxy fluorescein (JOE), fluorescein, fluorescein 
isothiocyanate, QFITC, (XRJTC); fluorescamine; IR144; IR1446; Malachite Green 
isothiocyanate; 4-methylumbelliferoneortho cresolphthalein; nitrotyrosine; pararosaniline; 
Phenol Red; B-phycoerythrin; o-phthaldialdehyde; pyrene and derivatives: pyrene, pyrene 

25 butyrate, succinimidyl 1 -pyrene; butyrate quantum dots; Reactive Red 4 (Cibacron™ 
Brilliant Red 3B-A) rhodamine and derivatives: 6-carboxy-X-rhodamine (ROX), 6- 
carboxyrhodamine (R6G), lissamine rhodamine B sulfonyl chloride rhodamine (Rhod), 
rhodamine B, rhodamine 123, rhodamine X isothiocyanate, sulforhodamine B, 
sulforhodamine 101, sulfonyl chloride derivative of sulforhodamine 101 (Texas Red); 

30 N,N,N',N'-tetramethyl-6-carboxyrhodamine (TAMRA); tetramethyl rhodamine; 

tetramethyl rhodamine isothiocyanate (TRITC); riboflavin; rosolic acid; terbium chelate 
derivatives; Cy 3; Cy 5; Cy 5.5; Cy 7; IRD 700; IRD 800; La Jolla Blue; phthalo cyanine; 
and naphthalo cyanine. 
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Preferred fluorophore-quencher pairs include, but are not limited to, 
xanthene dyes, including fluoresceins, and rhodamine dyes. Many suitable forms of these 
compounds are widely available commercially with substituents on their phenyl moieties 
which can be used as the site for bonding or as the bonding functionality for attachment to 
5 the y-phosphate or nucleobase. Another group of fluorescent compounds are the 

naphthylamines, having an amino group in the alpha or beta position. Included among 
such naphthylamino compounds are l-dimethylaminonaphthyl-5-sulfonate, l-anilino-8- 
naphthalene sulfonate and 2-p-toluidinyl-6-naphthalene sulfonate. Other dyes include 3- 
phenyl-7-isocyanatocoumarin, acridines, such as 9-isothiocyanatoacridine and acridine 
10 orange; N-(p-(2-benzoxazolyl)phenyl)maleimide; benzoxadiazoles, stilbenes, pyrenes, 
and the like. 

Preferably, the fluorophore/quencher pair are selected from fluorescein 
and rhodamine dyes. These dyes and appropriate linking methodologies for attachment to 
nucleotides are described in many references, (see, Khanna et al. (cited above); Marshall, 
15 Histochemical J., 7:299-303 (1975); Menchen et al. , U.S. Patent No. 5,1 88,934; Menchen 
et al., European Patent Application 873 10256.0; and U.S. Patent No. 5,366,860, issued to 
Bergot et al. ). 

In a preferred embodiment, the quencher 4-(4'-dimethylaminophenylazo)- 
benzoic acid (DABCYL) is used. The quencher DABCYL (see, Figure 1) is commonly 

20 used in fluorogenic probes to detect DNA hybridization (molecular beacons) or protease 
activity. DABCYL quenches fluorescence from a wide variety of dyes emitting between 
475 nm and 805 ran, with measured efficiencies ranging from 90 to 99.9% (see, S. Tyagi 
et al.,Nat. Biotechnol. 16, 49 (1998); and G.T. Wang et al, Tetrahedron Lett. 31, 6493 
(1990)). Without being bound by any particular theory, it is believed that the quenching 

25 mechanism of DABCYL probably involves electron transfer, rather than fluorescence 

resonance energy transfer, because it is wavelength-independent. In an equally preferred 
embodiment, the quenchers dinitrophenyl (DNP) or trinitrophenyl (TNP) are used. 

Quenching efficiency as measured in any particular experiment depends 
on the purity of the dye-quencher pair (contaminating free dye or cleaved molecules 

30 fluoresce); the electronic coupling and physical distance between dye and quencher 

(closer is usually better); and the excited-state lifetime of the dye (the longer the time, the 
greater the chances for electron transfer). 



10 



WO 00/36152 PCT/US99/29585 

In certain embodiments, certain visible and near IR dyes are known to be 
sufficiently fluorescent and photostable to be detected as single molecules. In this aspect 
the visible dye, BODIPY R6G (525/545), and a larger dye, LI-COR's near-infrared dye, 
IRD-38 (780/810) can be detected with single-molecule sensitivity and are used to 
5 practice the present invention. 

There are many linking moieties and methodologies for attaching 
fluorophore or quencher moieties to nucleotides, as exemplified by the following 
references: Eckstein, editor, Oligonucleotides and Analogues: A Practical Approach (IRL 
Press, Oxford, 1991); Zuckerman et al. Nucleic Acids Research, 15: 5305-5321 (1987) 

1 0 (3' thiol group on oligonucleotide); Sharma et al. , Nucleic Acids Research, 19: 30 1 9 

(1991) (3' sulfhydryl); Giusti etal, PCR Methods and Applications, 2: 223-227 (1993) 
and Fung et al., U.S. Pat. No. 4,757,141 (5' phosphoamino group via Aminolink™. II 
available from Applied Biosystems, Foster City, Calif.) Stabinsky, U.S. Pat. No. 
4,739,044 (3' aminoalkylphosphoryl group); Agrawal et al., Tetrahedron Letters, 31: 

1 5 1 543- 1 546 (1 990) (attachment via phosphoramidate linkages); Sproat et al. , Nucleic 
Acids Research, 15: 4837 (1987) (5' mercapto group); Nelson et al, Nucleic Acids 
Research, 17: 7187-7194 (1989) (3* amino group); and the like. 

With reference to Figure 2, the preferred linkers to several bases of various 
dNTP structures of the present invention are shown. Again, DABCYL is a wavelength- 

20 independent fluorescence quencher having quenching efficiencies in the range required 
by the present methods. Better quenching permits higher dNTP concentrations and faster 
turnover rates. In certain aspects, the quencher is linked to the same nucleobase sites 
typically employed for attaching dyes {see, Figure 1). As the polymerase progresses 
along the DNA, the quencher will remain at every incorporated base. In certain aspects, 

25 the quencher is covalently attached to a dNTPs using the C5 linker shown in Figure I . In 
certain embodiments, the quencher moiety is attached to the fluorophore moiety via a 
linker. In certain other embodiments, the quencher can be attached to the sugar of the 
dNTPs. 

In general, nucleoside labeling can be accomplished using any of a large 
30 number of known nucleoside labeling techniques using known linkages, linking groups, 
and associated complementary functionalities. The linkage linking the quencher moiety 
and nucleoside should be compatible with relevant polymerases and not quench the 
fluorescence of the fluorophore moiety. 
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Preferably, the quenchers are covalently linked to the 5 -carbon of 
pyrimidine bases and to the 7-carbon of 7-deazapurine bases. Several suitable base 
labeling procedures have been reported that can be used with the present invention, e.g. 
Gibson et al. , Nucleic Acids Research, 15: 6455-6467 (1987); Gebeyehu et al. , Nucleic 
5 Acids Research, 15: 45 1 3-4535 ( 1 987); Haralambidis et al. , Nucleic Acids Research, 15: 
4856-4876 (1987); Nelson et al, Nucleosides and Nucleotides, 5(3) 233-241 (1986); 
Bergstrom, etal.,JACS, 111, 374-375 (1989); U.S. Pat. Nos. 4,855,225, 5,231,191, and 
5,449,767, each of which is incorporated herein by reference. Preferably , the linkages 
are acetylenic amido or alkenic amido linkages, the linkage between the quencher and the 
1 0 nucleotide base being formed by reacting an activated N-hydroxysuccinimide (NHS) ester 
of the dye with an alkynylamino- or alkenylamino-derivatized base of a nucleotide. More 
preferably, the resulting linkages are shown in Figure 2. 

The synthesis of alkynylamino-derivatized nucleosides is taught by Hobbs 
et al. in European Patent Application No. 87305844.0, and Hobbs et al.,J. Org. Chem., 
15 54: 3420 (1989), which is incorporated herein by reference. As taught therein, the 
alkynylamino-derivatized nucleotides are formed by placing the appropriate 
halcdeoxynucieoside (usually 5-iodopyrimidine and 7-iodo-7-deazapurine 
deoxynucleosides and Cu(I) in a flask, flushing with argon to remove air, adding dry 
DMF, followed by addition of an alkynylamine, triethyl-amine and Pd(0). The reaction 
20 mixture can be stirred for several hours, or until thin layer chromatography indicates 
consumption of the halodeoxynucleoside. 

As taught in U.S. Patent No. 5,047,519, which issued to Hobbs et al. on 
September 10, 1991, the alkynylamino linkers have the structure: 

Nuc — C=C — R 1 NR 2 R 3 

25 wherein R 1 is a substituted or unsubstituted diradical moiety of 1 -20 atoms. Nuc is a 
purine or pyrimidine base. R 1 can be straight-chained alkylene, C\ -C 20, optionally 
containing within the chain double bonds, triple bonds, aryl groups or heteroatoms such 
as N, O or S. The heteroatoms can be part of such functional groups as ethers, thioethers, 
esters, amines or amides. Preferably, R 1 is straight-chained alkylene, C f -C20; most 

30 preferably R 1 is CH2. Substituents on R 1 can include Ci -C6 alkyl, aryl, ester, ether, 
amine, amide or chloro groups. R 2 and R 3 are independently H, alkyl, Ci -C 4> or a 
protecting group such as acyl, alkoxycarbonyl, a fluorophore, a quencher or sulfonyl. 
Preferably R 2 is H, and R 3 is a quencher. The alkynylamino linker is preferably attached 
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to the 5-position of the pyrimidine nucleotides and the 7 position of the purine 
nucleotides. 

In a preferred embodiment, a quencher-sulfoNHS derivative e.g., 
DABCYL-sulfoNHS is prepared using DABCYL and reacting it with N- 
5 hydroxysulfosuccinimide and N,N'-dicyclohexylcarbodiimide under a nitrogen 
atmosphere (see, Figure 1). The DABCYL-sulfoNHS derivative is reacted with an 
aminoallyl-dNTP e.g. aminoallyl-dUTP, to produce the DABCYL-dUTP. Using the 
DABCYL-dUTP and cystamine dihydrochloride, a DABCYL-dUTP-thiol derivative can 
be generated. Thereafter, a NTP having a y-phosphate fluorophore attached can be 
1 0 produced by reacting for example the DABCYL-dUTP-thiol with BODIPY TR- 

iodoacetamide (commercially available from Molecular Probes D-601 1) to produce 
DABCYL-dUTP-BODIPY TR. 

C. Quenching Efficiency 

The present invention provides NTP molecules having a y-phosphate with 
1 5 a fluorophore moiety attached thereto. The fluorophore moiety exists quenched with at 
least about a 5 fold quenching efficiency when the y-phosphate is attached to the NTP and 
is unquenched i.e., is fluorescent, when the y-phosphate is detached from the NTP. 
Preferably, the fluorophore moiety exists quenched with at least about a 3 fold quenching 
efficiency to about 100 fold quenching efficiency. In a more preferred embodiment, the 
20 fluorophore moiety exists quenched with at least about a 100 fold quenching efficiency to 
about a 1000 fold quenching efficiency. 

The quenching efficiency of the NTPs of the present invention is a routine 
parameter easily determined. As will be apparent to those of skill in the art, quenching 
efficiency can be measured in a fluorometer optionally having laser excitation. Similar to 
25 the earlier discussion of the Stern-Volmer equation, quenching efficiency is equal to 

F 0 -F/ F 0 

wherein F 0 is fluorescence of the NTP without quenching and F is the quenched 
fluorescence. Since there is no certain way to eliminate all fluorescence in a sample of 
quenched NTP, the unquenched measurements, F Q , can be taken in a separate sample 
30 containing dye alone and the quenched measurements, F, can be made on the same 
concentration of quenched dNTP. 

The compounds of the present invention have at least 3 fold quenching 
efficiency. A fully fluorescent dye has a F 0 value of 1 , whereas a dye quenched by 90% 
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has an F value of 0. 100. A compound quenched by 90%, has a quenching efficiency of 
0.9 or is 10 fold quenched. Therefore, for compounds of the present invention, F is 
characterized as follows: 0.670 < F< 0.999, i.e., the compounds possess quenching 
efficiencies between about 3 fold to about 1 000 fold. Preferably the quenching efficiency 
5 of a compound of the present invention is about at least 5 fold to about 1000 fold, and 
more preferably, the quenching efficiency is about at least 10 fold to about 1000 fold. 

In the present invention, detection of pyrophosphate depends on generating 
a fluorescent signal by dequenching, or turning on, a quenched fluorescent dye in 
response to pyrophosphate. Efficient quenching provides a lower background 

1 0 fluorescence, enhancing the signal-to-noise ratio upon dequenching by pyrophosphate. 
Incomplete quenching results in a low level fluorescence background from each dye 
molecule. Additional background fluorescence is contributed by a few of the dye 
molecules that are fully fluorescent because of accidental (i.e., pyrophosphate- 
independent) dequenching, for example by breakage of a bond connecting the dye to the 

15 quencher moiety. Thus, the background fluorescence has two components: a low-level 
fluorescence from all dye molecules, referred to herein as "distributed fluorescence 
background" and full-strength fluorescence from a few molecules, referred to herein as 
"localized fluorescence background". 

In instances where a multi-labeling scheme is utilized, a wavelength which 

20 approximates the mean of the various candidate labels' absorption maxima may be used. 
Alternatively, multiple excitations may be performed, each using a wavelength 
corresponding to the absorption maximum of a specific label. Table I lists examples of 
various types of fluorophores and their corresponding absorption maxima. 
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TABLE I 



Candidate Fluorophores 


Absorbance/Emission 


Rhol23 


507/529 


R6G 


528/551 


BODIPY 576/589 


576/589 


BODIPY TR 


588/616 


Nile Blue 


627/660 


BODIPY 650/665 


650/665 


Sulfo-IRD700 


680/705 


NN382 


778/806 


Tetramethylrhodamine 


550 


Rodamine X 


575 


Cy3 TM 


550 


Cy5 TM 


650 


Cy7 TM 


750 



5. Miscellaneous reaction reagents. 
5 The polymerase selected for use in this invention is not critical. Preferred 

polymerases are able to tolerate labels both on the nucleobase and on the gamma- 
phosphate. The polymerase should have a fidelity (incorporation accuracy) of at least 
99% and a processivity (number of nucleotides incorporated before the enzyme 
dissociates from the DNA) of at least 20 nucleotides, with greater processivity preferred. 
10 Examples include T7 DNA polymerase, T5 DNA polymerase, HIV reverse transcriptase, 
E. coli DNA pot I, T4 DNA polymerase, T7 RNA polymerase, Taq DNA polymerase and 
E. coli RNA polymerase. Exonuclease-defective versions of these polymerases are 
preferred. 

The primers (DNA polymerase) or promoters (RNA polymerase) are 
15 synthetically made using conventional nucleic acid synthesis technology. The 

complementary strands of the probes are conveniently synthesized on an automated DNA 
synthesizer, e.g. an Applied Biosystems, Inc. (Foster City, Calif.) model 392 or 394 
DN A/RNA Synthesizer, using standard chemistries, such as phosphoramidite chemistry, 
e.g. disclosed in the following references: Beaucage and Iyer, Tetrahedron, 48: 2223- 

15 
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231 1 (1992); Molko et al, U.S. Pat. No. 4,980,460; Koster et al, U.S. Pat. No. 4,725,677; 
Caruthers et al, U.S. Pat. Nos. 4,415,732; 4,458,066; and 4,973,679; and the like. 
Alternative chemistries, e.g. resulting in non-natural backbone groups, such as 
phosphorothioate, phosphoramidate, and the like, may also be employed provided that the 
5 resulting oligonucleotides are compatible with the polymerase. They can be ordered 
commercially from a variety of companies which specialize in custom oligonucleotides 
such as a Operon Inc. (Alameda, California). 

Primers in combination with polymerases are used to sequence target 
DNA. Primer length is selected to provide for hybridization to complementary template 
10 DNA The primers will generally be at least 10 bp in length, usually at least between 15 
and 30 bp in length. Primers are designed to hybridize to known internal sites on the 
subject target DNA. Alternatively, the primers can bind to synthetic oligonucleotide 
adaptors joined to the ends of target DNA by a ligase. Similarly where promoters are 
used, they can be internal to the target DNA or ligated as adaptors to the ends. 

15 6. Reaction conditions. 

The reaction mixture for the sequencing comprises an aqueous buffer 
medium which is optimized for the particular polymerase. In general, the buffer includes 
a source of monovalent ions, a source of divalent cations and a buffering agent. Any 
convenient source of monovalent ions, such as KC1, K-acetate, NFLt-acetate, K-glutamate, 

20 NH4CI, ammonium sulfate, and the like may be employed, where the amount of 
monovalent ion source present in the buffer will typically be present in an amount 
sufficient to provide for a conductivity in a range from about 500 to 20,000, usually from 
about 1000 to 10,000, and more usually from about 3,000 to 6,000 micromhos. 

The divalent cation may be magnesium, managanese, zinc and the like, 

25 where the cation will typically be magnesium. Any convenient source of magnesium 

cation may be employed, including MgCl 2 , Mg-acetate, and the like. The amount of Mg 
ion present in the buffer may range from 0.5 to 20 mM, but will preferably range from 
about 1 to 12mM, more preferably from 2 to lOmM and will ideally be about 5mM. 

Representative buffering agents or salts that may be present in the buffer 

30 include Tris, Tricine, HEPES, MOPS and the like, where the amount of buffering agent 
will typically range from about 5 to 150 mM, usually from about 10 to 100 mM, and 
more usually from about 20 to 50 mM, where in certain preferred embodiments the 
buffering agent will be present in an amount sufficient to provide a pH ranging from 
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about 6.0 to 9.5, where most preferred is pH 7.6 at 25° C. Other agents which may be 
present in the buffer medium include chelating agents, such as EDTA, EGTA and the 
like. 

7. Sample Housing. 

5 The solid support is optionally housed in a flow chamber having an inlet 

and outlet to allow for renewal of reactants which flow past the immobilized moieties. 
The flow chamber can be made of plastic or glass and should either be open or 
transparent in the plane viewed by the microscope or optical reader. Electro-osmotic flow 
requires a fixed charge on the solid support and a voltage gradient (current) passing 
1 0 between two electrodes placed at opposing ends of the solid support. The flow chamber 
can be divided into multiple channels for separate sequencing. Examples of micro flow 
chambers exist. For example, Fu et al. {Nat. Biotechnol. (1999) 17:1 109) describe a 
microfabricated fluorescense-activated cell sorter with 3 urn x 4um channels that utilizes 
electro-osmotic flow for sorting. 
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8. Detection of fluorophores. 

This invention requires the imaging of single molecules in a solution. 
There are a variety of known ways of achieving this goal. General reviews are available 
describing this technology. Reviews include Basche et. al., eds., 1996, Single molecule 
5 optical detection, imaging, and spectroscopy, Weinheim:VCM, and Plakhotnik, et. al., 
Single-molecule spectroscopy, Ann. Rev. Phys, Chem. 48: 181-212. In general, the 
methods involve detection of laser activated fluorescence using microscope equipped 
with a camera. It is sometimes referred to as a high-efficiency photon detection system. 
Nie, et. al., 1994, Probing individual molecules with confocal fluorescence microscopy, 

1 0 Science 266: 1018-101 9. The detection of single molecules involves limiting the detection 
to a field of view in which one has a statistical reason to believe there is only one 
molecule (homogeneous assays) or to a field of view in which there is only one actual 
point of attachment (heterogeneous assays). The single-molecule fluorescence detection 
of the present invention can be practiced using optical setups including near-field 

1 5 scanning microscopy, far-field confocal microscopy, wide-field epi-illumination, and 
total internal reflection fluorescence (TIRF) microscopy. For two-dimensional imaging 
fluorescence detection, the microscope is typically a total internal reflectance microscope. 
Vale et. al., 1996, Direct observation of single kinesin molecules moving along 
microtubules, Nature 380: 451, Xu and Yeung 1997, Direct Measurement of Single- 

20 Molecule Diffusion and Photodecomposition in Free Solution, Science 215: 1 106-1 109. 

Suitable photon detectors include, but are not limited to, photodiodes and 
intensified CCD cameras. In a preferred embodiment an intensified charge couple device 
(ICCD) camera is used. The use of a ICCD camera to image individual fluorescent dye 
molecules in a fluid near the surface of the glass slide is advantageous for several reasons. 

25 With an ICCD optical setup, it is possible to acquire a sequence of images (movies) of 
fluorophores. In certain aspects, each of the NTPs of the present invention has a unique 
fluorophore associated with it, as such, a four-color instrument can be used having four 
cameras and four exitation lasers. Thus, it is possible to use this optical setup to sequence 
DNA. In addition, many different DNA molecules spread on a microscope slide can be 

30 imaged and sequenced simultaneously. Moreover, with the use of image analysis 

algorithms, it is possible to track the path of single dyes and distinguish them from fixed 
background fluorescence and from "accidentally dequenched" dyes moving into the field 
of view from an origin upstream. 
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In certain aspects, the preferred geometry for ICCD detection of single- 
molecules is total internal reflectance fluorescence (TIRF) microscopy. In TIRP, a laser 
beam totally reflects at a glass- water interface. The optical field does not end abruptly at 
the reflective interface, but its intensity falls off exponentially with distance. The thin 
5 "evanescent" optical field at the interface provides low background and enables the 
detection of single molecules with signal-to-noise ratios of 12:1 at visible wavelengths 
(see, M. Tokunaga et al, Biochem. and Biophys. Res, Comm. 235, 47 (1997) and P. 
Ambrose, Cytometry, 36, 244 (1999)). 

The penetration of the field beyond the glass depends on the wavelength 

10 and the laser beam angle of incidence. Deeper penetrance is obtained for longer 

wavelengths and for smaller angles to the surface normal within the limit of a critical 
angle. In typical assays, fluorophores are detected within about 200 nm from the surface 
which corresponds to the contour length of about 600 base pairs of DNA. Preferably, a 
prism-type TIRF geometry for single-molecule imaging as described by Xu and Yeung is 

15 used (see, X-H.N. Xu et al, Science, 281, 1650 (1998)). 

Single molecule detection can be achieved using flow cytometry where 
flowing samples are passed through a focused laser with a spatial filter used to define a 
small volume. US Pat. No. 4,979,824 describes a device for this purpose. US Pat. No. 
4,793,705 describes and claims in detail a detection system for identifying individual 

20 molecules in a flow train of the particles in a flow cell. The '705 patent further describes 
methods of arranging a plurality of lasers, filters and detectors for detecting different 
fluorescent nucleic acid base-specific labels. US Pat. No. 4,962,037 also describes a 
method for detecting an ordered train of labeled nucleotides for obtaining DNA and RNA 
sequences using a nuclease to cleave the bases rather than a polymerase to synthesize as 

25 described herein. Single molecule detection on solid supports is described in Ishikawa, et 
al. (1994) Single-molecule detection by laser-induced fluorescence technique with a 
position-sensitive photon-counting apparatus, Jan. J. Apple. Phys. 33:1571-1576. 
Ishikawa describes a typical apparatus involving a photon-counting camera system 
attached to a fluorescence microscope. Lee et al. (1994), Laser-induced fluorescence 

30 detection of a single molecule in a capillary, Anal. Chem., 66:4142-4149 describes an 

apparatus for detecting single molecules in a quartz capillary tube. The selection of lasers 
is dependent on the label and the quality of light required. Diode, helium neon, argon ion, 
argon-krypton mixed ion, and Nd:YAG lasers are useful in this invention. 
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This invention can be viewed as a system of components designed to 
detect the synthesis of a nucleic acid as PPi is released. Figure 4 provides an overview of 
this system which is described in detail in Example 2. Figure 5 illustrates a typical 
arrangement of the polymerase (14) immobilized on the surface of the solid support (15) 
5 with a target DNA (13) being extended by the incorporation of gamma labeled dNTPs 
(17). More specifically, the target DNA is hybridized to a primer that will become the 
extension product generated by the tethered DNA dependent DNA polymerase. 

Figure 6 illustrates a continuous sequences of double stranded DNA 
molecules (18) using double stranded DNA dependent RNA polymerase (19) 
) immobilized on a solid support (15) where the extension product is transcribed as a new 
RNA (20) from the labelled rNTPs (21). More specifically, a double stranded DNA 
template binds to the RNA polymerase through sequence recognition of a polymerase 
promoter sequence. The sequence is read as the DNA travels through the enzyme 
yielding a nascent RNA. After the RNA and DNA are released, the enzyme is ready to 
5 sequence the next DNA molecule passing by. 

These schemes ensure that the active site remains in the evanescent light 
field at the surface, so that every dye released from a quenched dNTP is illuminated. 
Tethering of the polymerase, rather than the target nucleic acid (template) is convenient 
because it provides for a continuous sequencing process where one immobilized enzyme 
) sequences many different DNA molecules. 

All publications and patent applications cited in this specification are herein 
incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. 

Although the foregoing invention has been described in some detail by way 
5 of illustration and example for purposes of clarity of understanding, it will be readily 
apparent to those of ordinary skill in the art in light of the teachings of this invention that 
certain changes and modifications may be made thereto without departing from the spirit or 
scope of the appended claims. 

EXAMPLES 

0 The following examples are provided by way of illustration only and not by 

way of limitation. Those of skill will readily recognize a variety of noncritical parameters 
which could be changed or modified to yield essentially similar results. 
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EXAMPLE 1. POLYMERASE ARRAY 

This example shows how to fabricate an array of nickel nanodots and 
attach a polymerase to each dot. The required equipment includes a spinner (PWM 202 
E-beam resist spinner, Headway Research Inc.), an evaporator (SC4500 thermal e-gun 
5 evaporator, CVC Products Inc.), and a scanning electron microscope (Leo 982 with 
Nabity pattern generator, Leo Electron Microscopy Inc.). 

Clean a 25 mm diameter microscope coverslip on the spinner by spraying 
alternately with acetone and isopropyl alcohol (IP A) and spinning the last IPA film until 
dry. Coat the coverslip in the spinner with 0.5 ml of PMMA (poly(methyl 
10 methylacrylate), MW 496 kDa, 2% in chlorobenzene), bake on a hoplate at 170 C for 10 
min, coat with 0.5 ml of PMMA (MW 950 kDa, 2% in methyl isobutyl ketone [MIBK]), 
and bake again. Apply the conductive layer by evaporating 1 00 Angstroms of gold onto 
the PMMA film in the CVC SC4500. Use the electron microscope to etch the array 
pattern into the PMMA film using a pattern generator on the Leo 982 as specified by a 
15 CAD drawing (Design CAD, 50 run spots, 10 urn center-to-center spacing, 200 x 200 dot 
array). 

Remove the gold layer by placing the exposed coverslip in Gold Etch (15- 
1 7% sodium iodide) for 7 seconds followed by rinsing with IPA and water. Deposit 
Tantalum (50 Angstroms) and Nickel (1 00 Angstroms) on the coverslip in the CVC 

20 SC4500. Remove the PMMA in a 1:1 mix of acetone and methylene chloride for 10-15 
min followed by sonication for several seconds and rinsing with IPA and water. 

Attach the polymerase just before use by applying 10 ul of a 15 nM 
solution of polyhistidine-tagged Klenow DNA polymerase exo" (prepared using TOPO 
cloning vector and ProBond Resin, Invitrogen Inc.) in phosphate-buffered saline (PBS; 

25 Harlow E., Lane D. 1988. Antibodies A Laboratory Manual. Cold Spring Harbor 

Laboratory ISBN 0-87969-14-2) to the coverslip; after 20 min, wash the coverslip in PBS 
and use immediately. 

EXAMPLE 2. TWO-COLOR SINGLE-MOLECULE IMAGING MICROSCOPE 

The microscope described in Figure 4 is used to detect single molecule 
30 events on the surface of the coverslip obtained in Example 1 . The microscope is fitted 
with a multicolor mixed-gas laser (1) which emits light at tunable wavelengths. The 
laser beam is first passed through a laser line filter (10) in order to filter out undesirable 
wavelengths. The unfocused output passes to a fused-silica right angle prism (2), which 
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is located directly under the cover slip (3). The sample (4) molecules sit in a buffer 
solution on the cover slip. 

Laser light entering the prism is refracted at an angle such that the critical 
angle between fused-silica and the buffer solution is exceeded. The light is thus 
5 completely reflected within the prism, giving rise to a total internal reflection (TIR) 
evanescent field (9) in the buffer. The area of interest is imaged using a microscope (6) 
with an objective lens (5) immersed directly in the buffer. Fluorescence emission at the 
microscope output passes through a multi-wavelength viewer (7), which spatially 
separates images at different wavelengths. This allows the discrimination of events 
1 0 occurring at different wavelengths. 

The images are projected onto the array of an intensified CCD (ICCD) 
camera (8), from which they are digitized and stored in memory (11). Image analysis 
algorithms (12) are performed on the images stored in memory. These algorithms may 
distinguish signal from background, track molecules, and perform other signal processing 
1 5 functions. The memory and signal processing may be performed off-line in a computer, 
or in specialized digital signal processing (DSP) chips controlled by a microprocessor. 
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EXAMPLE 3. SYNTHESIS OF DUAL-LABELED NUCLEOTIDES 



Example 3A. The synthesis of DABCYL-dUTP-BODIPY TR. (Figure 3). 

a. Preparation of D ABC YL-sulfoNHS 

DABCYL (108 mg, 0.4 mmole; Aldrich 25,019-8 Methyl red) was 
5 dissolved in a mixture of dry N,N-dimethylformamide (10 mL; Aldrich 22,705-6) and dry 
pyridine (96 mg, 1.2 mmole; Aldrich 27,097-0). N-Hydroxysulfosuccinimide (260 mg, 
1.2 mmole; Pierce 24510) and N,N'-dicyclohexylcarbodiimide (250 mg, 1.2 mmole; 
Pierce 20320) were added and the mixture was stirred at 50°C for 2 hours under a 
nitrogen atmosphere. The reaction was monitored by TLC (MKC 1 8F Reversed Phase; 
10 Whatman 4803-1 10; developed in 0. 1 M triethylammonium acetate, pH 7, 80% 

acetonitrile). After dilution with ether, the supernatant was decanted, the product was 
washed with ether on a filter, dried, and stored dessicated at -20°C. 

b. Synthesis of DABC YL-dUTP 

Aminoallyl-dUTP (10 mg, 20 umole; Sigma A 0410) was mixed with 
15 DABCYL-sulfoNHS (30 mg, 30 umole; from step A) in 3 mL of 0.1 M sodium carbonate 
pH 8.3. The mixture was incubated in the dark for 4 hours at room temperature and the 
reaction was monitored by TLC (as in step A). The DABCYL-dUTP product was 
purified by reversed-phase HPLC using a linear gradient from 0% to 100% of Buffer B 
mixed into Buffer A over 20 minutes (Buffer A is 0.1 M triethylammonium acetate in 
20 water, pH 7, 4% acetonitrile; Buffer B is the same as Buffer A with 80% acetonitrile). 

c. Synthesis of DABCYL-dUTP-thiol 

DABCYL-dUTP (9 mg, 12 umole; from step B) was dissolved in lmL of 
0.1 M MES pH 5.7 (Sigma M 3023) and adjusted to pH 5.75. Cystamine dihydrochloride 
(10 mg, 44 umol; Sigma C 8707) was dissolved in 2.5 mL of 0.1 M MES pH 5.7 and 

25 adjusted to pH 5.75. EDC (9 mg, 47 umol; Pierce 22980) was dissolved in 0.5 mL of 0. 1 
M MES pH 5.7 and was added immediately to the DABCYL-dUTP solution. After 10 
minutes, the cystamine solution was added and the pH was maintained between 5.5 and 
5.8 while the reaction proceeded at room temperature. After two hours, the pH was 
adjusted to 7.0 and the sample was stored at -20°C. The product was purified by 

30 reversed-phase HPLC as in step b. 
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d. Synthesis of DABCYL-dUTP-BODIPY TR 

DABCYL-dUTP-thiol (2.5 mg, 3 \xmo\e; from step C) was dissolved in 
5.4 mL of 5 raM TCEP (Pierce 20490), 30 mM sodium phosphate adjusted to pH 7.5. 
BODIPY TR-iodoacetamide (5mg, 7.4 umol; Molecular Probes D-601 1) was dissolved in 
5 2.6 mL of N,N-dimethylformamide and was added to the DABCYL-dUTP-thiol solution. 
After standing at room temperature in the dark for 5 hours, the product was purified by 
reversed-phase HPLC as in step b. 

e. Determination of Quenching Efficiency 

The quenching efficiency of DABCYL-dUTP-BODIPY TR was 
10 determined as follows. First, the fluorescence of a sample containing the dye BODIPY 
TR is measured. Second, a sample containing the same concentration of the nucleotide 
triphosphate having a y-phosphate with a fluorophore moiety attached i.e., DABCYL- 
dUTP-BODIPY TR is measured. Thereafter, the quenching efficiency, which is equal to 
F 0 -F/ F 0 wherein F 0 is fluorescence of the BODIPY TR alone and F is the fluorescence of 
15 DABCYL-dUTP-BODIPY TR is calculated. The fluorescence quenching efficiency of 
DABCYL-dUTP-BODIPY TR is at least 5 fold compared to the BODIPY TR alone. 

Example 3B. Synthesis of Rho-dUTP-BODIPY TR and Rho-dCTP-BODIPY TR 

a. Synthesis ofRho-dUTP 

Aminoallyl-dUTP (20 umole; Sigma A 0410) was mixed with 5- 
20 carboxyrhodamine 6G, succinimidyl ester (30 umole; Molecular Probes C-6127) in 3 ml 
of 0.1 M sodium carbonate pH 8.3, 20% DMF. The mixture was incubated in the dark at 
room temperature and the reaction was monitored by TLC (MKC1 8F Reversed Phase; 
Whatman 4803-1 10; developed in 0.1 M triethylammonium acetate, pH 7, 80% 
acetonitrile). The Rho-dUTP product was purified by reversed-phase HPLC using a 
25 linear gradient from 0% to 100% of Buffer B mixed into Buffer A over 20 minutes 

(Buffer A is 0.1 M triethylammonium acetate in water, pH 7, 4% acetonitrile; Buffer B is 
the same as Buffer A with 80% acetonitrile). 

b. Synthesis of Rho-dUTP-thiol 

Rho-dUTP (12 umole; from step a) was dissolved in 1ml of 0.1 M MES 
30 pH 5.7 (Sigma M 3023) and adjusted to pH 5.75. Cystamine dihydrochloride (44 umol; 
Sigma C 8707) was dissolved in 2.5 ml of 0.1 M MES pH 5.7 and adjusted to pH 5.75. 
EDC (47 umol; Pierce 22980) was dissolved in 0.5 ml of 0.1 M MES pH 5.7 and was 
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added immediately to the Rho-dUTP solution. After 10 minutes, the cystamine solution 
was added and the pH was maintained between 5.5 and 5.8 while the reaction proceeded 
at room temperature. After two hours, the pH was adjusted to 7.0 and the sample was 
stored at -20 C. The product was purified by reversed-phase HPLC as described in step 
5 a. 

c Synthesis of Rho-dUTP-BQDIP Y TR 

Rho-dUTP-thiol (3 umole; from step b) was dissolved in 5.4 ml of 5 mM 
TCEP (Pierce 20490), 30 mM sodium phosphate adjusted to pH 7.5. BODIPY TR- 
iodoacetamide (7.4 umol; Molecular Probes D-601 1) was dissolved in 2.6 ml of N,N- 
10 dimethylformamide and was added to the Rho-dUTP-thiol solution. After standing at 
room temperature in the dark for 5 hours, the product was purified by reversed-phase 
HPLC as in step a. 

d. Synthesis of Rho-dCTP-BQDIPY TR 

Labeled dCTP was made essentially as described for dUTP, except that 
15 aminoallyl-dCTP was substituted for the dUTP. Aminoallyl-dCTP was synthesized 
according to U.S. Patent 5,476,928 except that dCTP was used instead of dUTP. 

EXAMPLE 4. GENOTYPING 

This example illustrates a sequence-based genotyping assay on single 
DNA molecules. The target is the (Delta)F508 deletion of the cystic fibrosis 

20 transmembrane conductance regulator gene (Welsh MJ, et al., 1993. J. Cell Science 
106S:235-239). Genomic DNA is isolated from whole blood of CFTR-homozygous 
individuals (Wizard Genomic DNA Purification Kit Cat. No. Al 120, Promega Inc.). 
Amplify a 54-nucleotide segment of the CFTR gene by PCR using the primers 5'- 
CACCATTAAAGAAAATATCAT (primer 1 Seq. ID. No 1) and 5'-Biotinyl- 

25 CTGTATCTATATTCATCATAG (primer 2 Seq. ID No. 2). 

The custom primers are obtained from a commercial source (e.g., Operon 
Technologies, Alameda, CA). Primer 1 hybridizes to the CFTR sequence adjacent to the 
(Delta)F508 deletion; the first nucleotide 3' to primer 1 is cytosine in the normal allele or 
thymine in the deletion mutant (see Table 2 below; from Vaughan P, McCarthy T. 1998. 

30 Nuc Acids Res 26: 8 1 0-8 1 5). Amplification conditions are 200 ng of genomic DNA, 0.2 
mM each of dATP, dCTP, dGTP and dUTP, 0.2 uM of each primer, 2.0 mM MgC12, 1 
unit of Taq DNA polymerase, 50mM KC1, 10 mM Tris-HCl (pH 9 at 25 C), 0.1% Triton 
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X-100, in a final volume of 50 ul (30 cycles of 94 C for 30 sec, 52 C for 30 sec, 72 C for 
15 sec; optimization of Mg concentration and cycling parameters may be necessary, 
depending on the particular characteristics of the thermal cycler device employed). 

Purify the PCR product on streptavidin-coated magnetic beads 
5 (DYNAPURE Dye Terminator Removal kit, Dynal A.S.) and re-amplify for 15 cycles 
under the same conditions as before, except replace the genomic DNA with 2 ul of the 
purified amplification product and omit primer 1 in order to produce an excess of the 
single-stranded DNA product complementary to primer 1 . Remove the dNTPs from the 
amplified product ("template-primer") using magnetic beads as before. 

10 Place 10 ul of reaction mix (30 nM template-primer, 100 nM primerl, 1 

nM Rho-dCTP-BTR (Example 3), 10 mM Tris-HCl (pH 7.5), 5 mM MgCl 2 , 7.5 mM 
dithiothreitol) onto the polymerase array on the two-color single-molecule imaging 
microscope of Example 2 and acquire a series of images for 20 minutes. Similarly 
analyze a second sample that is identical to the first except that Rho-dCTP-BTR of the 

15 first sample is replaced with Rho-dUTP-BTR (Example 3). Analyze the image sets for 
incorporation events as indicated by separation of the two coupled dyes from one another. 
Compare the number of events observed using dCTP (sample 1) and dUTP (sample 2) to 
determine whether the test CFTR gene is the normal or deletion allele. 

Table 2 

20 Normal Allele 

CACCATTAAAGAAAATATCAT CUUUGGUGUUUCCUAUGAUGAAUAUAGAUA 
CAG Seq Id. No. 3 

GUOGUAAUUUCUUUUAUAGUAGAAACCACAAAG GATACTACTTATATCTAT 
GTC 

25 Deletion Mutant 

CACCATTAAAGAAAATATCAT UGGUGUUUCCUAUGAUGAAUAUAGAUACAG 
Seq ID. No. 4. 

GUGGUAAUUUCUUUUAUAGUAACCACAAAG GATACTACTTATATCTATGTC 

The primers are underlined (primer 1 at left, primer 2 at right). The three deleted base 
30 pairs are indicated in bold type in the normal sequence. 
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EXAMPLE 5. SEQUENCING 

Sequencing is accomplished as for genotyping (Example 4), with reading 
of more than one nucleotide downstream of the primer. 

a) Synthesis of BODIPY TR-dUTP-Rhodamine 
5 Follow the procedure of Example 3a to make BODIPY TR-dUTP, 

substituting BODIPY TR-X STP ester (Molecular Probes B- 10003) for the rhodamine of 
Example 3 a. Derivatize the BODIPY TR-dUTP with a thiol as in Example 3 b, and 
conjugate tetramethylrhodamine-5-iodoacetamide dihydroiodide (Molecular Probes T- 
6006) to the thiol as in Example 3c. 

10 b) Sequencing 

Use the same normal allele DNA template as prepared in Example 4. 
Place 10 ul of reaction mix (30 nM template-primer, 100 nM primerl, 1 nM BODIPY 
TR-dUTP-Rho (Example 5a), InM Rho-dCTP-BODIPY TR (Example 3d), 10 mM Tris- 
HC1 (pH 7.5), 5 mM MgCh, 7.5 mM dithiothreitol) onto the polymerase array on the two- 

15 color single-molecule imaging microscope of Example 2 and acquire an image series. 
Incorporation of dUTP is seen by separation of the two dyes, with the rhodamine being 
released. Incorporation of dCTP is detected by separation of the same two dyes, but with 
the Rhodamine being released instead of the BODIPY. 
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1 1 . 1 . A method of genotyping or sequencing a target nucleic acid 

2 comprising the steps of: 

3 i. immobilizing a nucleic acid polymerase or the target 

4 nucleic acid onto a solid support in a single molecule configuration; 

5 ii. contacting the solid support with a solution containing: 

6 (a) a target nucleic acid where the polymerase is 

7 immobilized or a polymerase where the target nucleic acid is immobilized; 

8 (b) a primer nucleic acid which complements a region of 

9 the target nucleic acid downstream of the region to be sequenced; 

10 (c) nucleotide triphosphates [NTP] where each type of base 

11 is differently labeled on the phosphate portion, where the labels provide a 

12 unique signal that is selectively detectable upon incorporation of NTP into 

13 the polymerase extension product; 

14 iii. permitting the polymerase to sequentially extend the primer 

15 incorporating the NTP creating a complement to the target nucleic acid; and, 

16 iv. determining extension of the primer by detecting the unique 

17 signal from the labeled NTP to genotype or to sequence the target nucleic acid. 

1 2. A method of claim 1 where the solution contains at least two 

2 different types of dNTP. 

1 3 . A method of claim 1 where the dNTPs are labeled on the gamma 

2 phosphate with a fluorescent label that differentially fluoresces when the gamma 

3 phosphate is cleaved away from the nucleoside. 

1 4. A method of claim 1 where the surface of the solid support 

2 comprises an array of locations each bearing a single molecule of polymerase or target 

3 DNA. 

1 5. A method of claim 1 where the polymerases are immobilized as an 

2 array of individual, single molecules onto a solid support. 

1 6. A method of claim 1 where the target DNA is a single nucleotide 

2 polymorphism. 



28 



WO 00/36152 



PCT/US99/29585 



1 7. A method of claim 6 wherein the solution comprises a single 

2 dNTP. 

1 8. A method of claim 1 where the detection is a sequential detection 

2 of different types of dNTP to provide the sequence of a target DNA. 

1 9. A method of claim 1 wherein the NTP is labeled with a fluorophore 

2 on the gamma phosphate and a quencher moiety. 

1 10. A method of claim 1 where the detector is an optical reader. 

1 1 1 . A method of claim 1 where the solution contacting the solid 

2 support flows past the immobilized polymerase or target DNA. 

1 12. A method claim 1 wherein the target nucleic acid is 

2 deoxynucleotide acid, the polymerase is either a DNA dependent DNA polymerase or a 

3 RNA dependent DNA polymerase and the NTPs are dNTPs. 

1 13. A method of claim 1 wherein the target nucleic acid is double 

2 stranded DNA and the polymerase is RNA polymerase and the NTPs are rNTPs. 

1 14. A system for genotyping or sequencing a target nucleic acid 

2 comprising: 

3 i. a solid support having a surface bearing an immobilized 

4 nucleic acid polymerase or the target nucleic acid in a single molecule 

5 configuration; 

6 ii. a solution for contacting the surface containing: 

7 (a) a target nucleic acid where the polymerase is 

8 immobilized or a polymerase where the target nucleic acid is immobilized; 

9 (b) a molar excess of nucleotide triphosphates [NTP], 

10 where each type of NTP base is differently labeled on the phosphate portion and 

1 1 where the labels provide a unique signal that is selectively detectable upon 

12 incorporation of the NTP into the polymerase extension product. 

1 15. A system of claim 14 wherein the polymerase is a DNA dependent 

2 DNA polymerase, the target nucleic acid is DNA and the NTPs are dNTPs said system 
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3 further comprising a primer nucleic acid which complements a region of the target 

4 nucleic downstream of the region to be sequenced. 

1 1 6. A system of claim 14 where the surface bears an immobilized 

2 DNA dependent DNA polymerase. 

1 1 7. A system of claim 14 wherein the solution comprises at least two 

2 different types of dNTP. 

1 1 8. A system of claim 14 where the surface of the solid support 

2 comprises an array of locations each bearing a single molecule of polymerase or target 

3 nucleic acid. 

1 19. A system of claim 14 where the surface of the solid support 

2 comprises an array of locations each bearing a single molecule of polymerase. 

1 20. A system of claim 14 where the NTPs are labelled on the gamma 

2 phosphate with a fluorescent label that differentially fluoresces when the gamma 

3 phosphate is cleaved away from the nucleoside. 

1 21 . A system of claim 14 where the target nucleic acid is DNA and 

2 where the DNA has a single nucleotide polymorphism from a wildtype DNA. 

1 22. A system of claim 14 wherein the solution comprises a single 

2 dNTP. 

1 23. A system of claim 14 wherein the polymerase is a RNA dependent 

2 DNA polymerase, the target nucleic acid is RNA, and the NTPs are dNTPs. 

1 24. A system of claim 14 wherein the polymerase is a DNA dependent 

2 RNA polymerase, the target nucleic acid is double stranded DNA and the NTPs are 

3 rNTPs. 

1 25. A system of claim 14 which further comprises an optical reader. 

1 26. A system of claim of claim 14 which further comprises electrodes 



2 where the electrodes are positioned to provide a voltage gradient across the solid support 



30 



WO 00/36152 PCT/US99/29585 

3 sufficient to permit flow of the solution contacting the immobilized polymerase or target 

4 nucleic acid. 

1 27. A solid support having a surface where the surface has a nucleic 

2 acid polymerase array attached to it wherein the members of the array consist of 

3 individual molecules of polymerase. 

1 28. A solid support of claim 25 wherein the members of the array are 

2 addressed. 

1 29. A solid support of claim 25 wherein the polymerase is a DNA 

2 dependent DNA polymerase. 

1 30. A solid support of claim 25 wherein the polymerase is a DNA 

2 dependent RNA polymerase. 
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SEQUENCE LISTING 
Seq. ID. No 1 

CACCATTAAAGAAAATATCAT 
Seq. ID No. 2 
5 Biotinyl-CTGTATCTATATTCATCATAG 
Seq. ID. No. 3 

CACCATTAAAGAAAATATCATCUUUGGUGUUUCCUAUGAUGAAUAUAGAUA 
CAG 

Seq. ID. No. 4. 

1 0 CACC ATTAAAGAAAATATCATUGGUGUUUCCUAUGAUGAAU AUAGAUAC AG 
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